# California

* There were two different kinds of audits conducted: a statewide hand count, and two county-level Risk Limiting Audits

* Data were obtained from the California Secretary of State at https://web.archive.org/web/20220829134421/https://www.sos.ca.gov/elections/post-election-audits/1percent-manual-tally/2020-general-election 

* The original `pdf` files are in the `original` folder

* The `pdf` files were transcribed to `csv` files using Able2Extract where possible. Those `csv` files are in the `transcribed` folder

* The files for Modoc, San Bernardino, San Joaquin, Santa Barbara, Santa Clara, and SolanoCounties were transcribed by hand by the research team due to the difficulty of automatically extracting them. These files are placed directly in the `ready` folder.

* For each county that could be automatically transcribed, running the corresponding `.py` file in the `src` folder generates a cleaned file in the `cleaned` folder
